DE-net: Dynamic Text-Guided Image Editing Adversarial Networks

نویسندگان

چکیده

Text-guided image editing models have shown remarkable results. However, there remain two problems. First, they employ fixed manipulation modules for various requirements (e.g., color changing, texture content adding and removing), which results in over-editing or insufficient editing. Second, do not clearly distinguish between text-required text-irrelevant parts, leads to inaccurate To solve these limitations, we propose: (i) a Dynamic Editing Block (DEBlock) that composes different dynamically requirements. (ii) Composition Predictor (Comp-Pred), predicts the composition weights DEBlock according inference on target texts source images. (iii) text-adaptive Convolution (DCBlock) queries features parts parts. Extensive experiments demonstrate our DE-Net achieves excellent performance manipulates images more correctly accurately.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of generative adversarial networks for automatic text-to-image generation

This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...

متن کامل

Neural Photo Editing with Introspective Adversarial Networks

The increasingly photorealistic sample quality of generative image models suggests their feasibility in applications beyond image generation. We present the Neural Photo Editor, an interface that leverages the power of generative neural networks to make large, semantically coherent changes to existing images. To tackle the challenge of achieving accurate reconstructions without loss of feature ...

متن کامل

Generative Adversarial Text to Image Synthesis

Automatic synthesis of realistic images from text would be interesting and useful, but current AI systems are still far from this goal. However, in recent years generic and powerful recurrent neural network architectures have been developed to learn discriminative text feature representations. Meanwhile, deep convolutional generative adversarial networks (GANs) have begun to generate highly com...

متن کامل

Text-image Coupling for Editing Literary Sources

Users need more sophisticated tools to handle the growing number of image-based documents available in databases. In this paper, we present a system devoted to the editing and browsing of complex literary hypermedia including original manuscript documents and other handwritten sources. Editing capabilities allow the user to transcribe manuscript images in an interactive way and to encode the re...

متن کامل

Spectral Image Visualization Using Generative Adversarial Networks

Spectral images captured by satellites and radiotelescopes are analyzed to obtain information about geological compositions distributions, distant asters as well as undersea terrain. Spectral images usually contain tens to hundreds of continuous narrow spectral bands and are widely used in various fields. But the vast majority of those image signals are beyond the visible range, which calls for...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i8.26189